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Abstract of th disclosur 

Fusion proteins with iissunoglobulln portions, the 
preparation and use thereof 

The invention relates to genetically engineered soluble 
fusion proteins composed of human proteins not belonging 
to the immunoglobulin family, or of parts thereof, and of 
various portions of the constant region of Immunoglobulin 
molecules. The functional properties of the two fusion 
partners are surprisingly retained in the fusion protein. 



Behring. rke Akti ngea llschaft HOE 90/B 026 - Ha 824 

and Dr. LP 

The General Hospital Corporati n 2045869 
Description 

Fusion proteins with immunoglobulin portions, the 
preparation and use thereof 



The invention relates to genetically engineered soluble 
fusion proteins composed of human proteins not belonging 
to the immunoglobulin family, or of parts thereof, and of 
various portions of the constant region of immunoglobulin 
moleculeo. The functional properties of the two fusion 
partners are, surprisingly, retained in the fusion 
protein. 

EP-A 0 325 262 and BP-A 0 314 317 disclose corresponding 
fusion proteins composed of various domains of the CD4 
membrane protein of human T cells and of h uman IgGl 
portions. Some of these fusion proteins bind with the 
same affinity to the glycoprotein gpl20 of human immuno- 
deficiency virus as the cell-bound CD4 molecule. The CD4 
molecule belongs to the immunoglobulin family and, 
consequently, has a very similar tertiary structure to 
that of immunoglobulin molecules. This also applies to 
the a chain of the T-cell antigen receptor, for which 
such fusions have also been described (Gascoigne et al., 
Proc. Natl. Acad. Sci. USA, vol. 84 (1987), 2937-2940). 
Hence, on the basis of the very similar domain structure, 
in this case retention of the biological activity of the 
two fusion partners in the fusion protein was to be 
expected . 

The human proteins which are, according to the invention, 
pr f rably c upl d to th amino terminus f th c nstant 
r gion f immunoglobulin d n t belong t th immuno- 
globulin family and ar to be assign d to the following 
class st (i) membrane-bound proteins whos extrac llular 
domain is wh lly or partly incorporated in th fusion. 
Th s are, in particular, thromboplastin and cytokin 
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r c ptors and growth factor r c ptora, such as th 
c llular rec ptorc for int rl ukin-4, int rl ukin-7, 
tumor necrosis factor, GM-CSF, G-CSF, erythropoietin; 
(ii) non-membrane-bound soluble proteins which are wholly 
or partly incorporated in the fusion. These are, in 
particularly, proteins of therapeutic interest such as, 
for example, erythropoietin and other cytokines and 
growth factors. 

The fusion proteins can be prepared in known pro- and 
eukaryotic expression systems, but preferably in mammal- 
ian cells (for example CHO, COS and BHK cells). 

The fusion proteins according to the invention are, by 
reason of their immunoglobulin portion, easy to purify by 
affinity chromatography and have improved pharmacokinetic 
properties in vivo. 



In many cases, the Fc part in fusion protein is 
thoroughly advantageous for use in therapy and diagnosis 
and thus results, for example, in improved pharma- 
cokinetic properties (EP-A 0232 262). On the other hand, 
for some uses it would be desirable to be able to delete 
the Fc part after the fusion protein has been expressed, 
detected and purified in the advantageous manner 
described. This is the case when the Fc portion proves to 
be a hindrance to use in therapy and diagnosis, for 
example when the fusion protein is to be used as antigen 
for immunisations. 

There are in existence various proteases whose use for 
this purpose appears conceivable. Papain and pepsin are 
employed, for example, to generate F(ab) fragments from 
immunoglobulins (Immunology, ed. Roitt, I. et al., Gower 
Medical Publishing, London (1989)), but th y do not 
cl *v In a particularly specific manner. Blood coagula- 
tion factor Xa by contrast recognis s in a pr t in th 
r lativ ly rar t trapeptid sequ nc II -Glu-Gly-Arg and 
performs a hydrolytic cl avaqe of th pr tein after the 
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arginin r sidu . S quenc s which c ntain th 
d scribed tetrapeptid wor introduc d first by Nagai and 
Thogersen in a hybrid protein by genetic engineering 
means (Nagai, K. and Thogersen, H.C., Nature, vol. 309 
5 (1984), 810-812). These authors were able to show that 

the proteins expressed in B. coll actually are specifi- 
cally cleaved by factor Za. However, there is as yet no 
published example of the possibility of such proteins 
also being expressed in eukaryot-lc and, especially, in 

10 animal cells and, after their purification, being cleaved 
by factor Xa. However, expression of the proteins 
according to the invention in animal cells is preferable 
because only in a cell system of this type is there 
expected to be secretion of, for example, normally 

15 membrane-bound receptors as fusion partners with 
retention of their natural structure and thus of their 
biological activity. Secretion into the cell culture 
supernatant facilitates the subsequent straightforward 
purification of the fusion protein. 

20 The invention thus relates to genetically engineered 
soluble fusion proteins composed of human proteins not 
belonging to the immunoglobulin family, or of parts 
thereof, and of various portions of the constant regions 
of heavy or light chains of immunoglobulins 'of various 

25 subclasses (IgG, XgM, IgA» IgB) . Preferred as immuno- 
globulin is the constant part of the heavy chain of human 
IgG, particularly preferably of human IgGl, where fusion 
takes place at the hinge region. In a particular embodi- 
ment, the Pc part can be removed in a simple way by a 

30 cleavage sequence which is also incorporated and can be 
cleaved with factor Za. 



Furth rmor , th inv ntion relat s to proc ss s for th 
pr paration f th s fusion proteins by g n tic ngineer- 
ing, and to th us th reof for diagnosis and th rapy. 



10 
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The invention will now be described in relation to th 
drawings, in which: 

Figure 1 shows two oligonucleotide probe molecules 
used in cloning of thromboplastin cDNA; 

Figure 2 shows the nucleotide sequence of clone 2b- 
Apr5 with the thromboplastin amino acid sequence deduced 
therefrom; 

Figure 3 shows two oligonucleotide sequences which 
are partially homologous with the sequence of the coding 
strand (A), and with the non-coding strand (B) of 
thromboplastin cDNA; 

Figure 4 shows the restriction map of plasmid 

pTFlFc; 

15 Figure 5 shows two oligonucleotide sequences which 

are partially homologous with the sequence of the coding 
strand (A), and with the non-coding strand (B) of the IL-4 
receptor cDNA cloned in the vector pDC302/T22-8; 

Figure 6 shows the restriction map of plasmid 

!0 pIL4RFc; 

Figure 7 shows two oligonucleotide sequences A and 
B which are partially homologous with the sequence of the 
coding strand (A), and with the non-coding strand (B) of the 
EPO cDNA cloned in the vector pCES; and 
5 Figure 8 shows the restriction map of plasmid 

pEPOFc . 



Finally, th invention is xplained in further examples. 



- 4 - 



2045869 



Examp le It Thromboplastin fuaion proteins 

Blood coagulation is a proc ss of c ntral importance in 
the human body. There is appropriately delicate regula- 
tion of the coagulation cascade, in which a large number 
of cellular factors and plasma proteins cooperate. These 
proteins (and their cof actors) in their entirety are 
called coagulation factors. The final products of the 
coagulation cascade are thrombin, which induces the 
aggregation of blood platelets, and fibrin which stabil- 
izes the platelet thrombus. Thrombin catalyzes the 
formation of fibrin from fibrinogen and itself is formed 
by limited proteolysis of prothrombin. Activated factor 
X (factor Xa) is responsible for this step and, in the 
presence of factor Va and calcium ions, binds to platelet 
membranes and cleaves prothrombin. 

Two ways exist for factor X to be activated, the extrin- 
sic and the intrinsic pathway. In the intrinsic pathway 
a series of factors is activated by proteolysis in order 
for each of them to form active proteases. In the extrin- 
sic pathway, there is increased synthesis of thrombo- 
plastin (tissue factor) by damaged cells, and it acti- 
vates factor X, together with factor Vila and calcium 
ions. It was formerly assumed that the activity of 
thromboplastin is confined to this reaction. However, the 
thromboplastin/VTIa complex also intervenes to activate 
the intrinsic pathway at the level of factor IX. Thus, a 
thromboplastin/VIIa complex is one of the most important 
physiological activators of blood coagulation. 

It is therefore conceivable that thromboplastin, apart 
from its use as diagnostic aid (see below), can also be 
employed as constitu nt of therapeutic ag nts f r treat- 
ing inborn or acquir d blood coagulation d ficiencies. 
Exampl 8 of this ar chronic hemophilias caus d by a 
d fici ncy of factors VIII, IX or XI or Is acute 
disturbanc a of blood coagulation as a cons qu nee of, 
for xampl , liver or kidney dis as . Us of such a 



th rapeutic ag nt aft r aurgicial int rv ntion would also 
be cone ivabl . 

Thromboplastin is an integral membrane protein which does 
not belong to the immunoglobulin family. Thromboplastin 
cDNA sequences have been published by a total of four 
groups (Fisher et al., Thromb. Res./ vol. 48 (1987), 
89-99; Morrisey et al., Cell, vol. 50 (1987), 129-135; 
Scarpa ti et al.. Biochemistry, vol. 26 (1987), 5234-5238; 
Spicer et al., Proc. Natl. Acad. Sci. USA, vol. 84 
(1937), 5148-5152). Thromboplastin cDNA contains an open 
reading frame which codes for a polypeptide of 295 amlno- 
acid residues, of which the 32 N-terminal amino acids act 
as signal peptide. Mature thromboplastin comprises 
263 amino-acid residues and has a three-domain structure} 
i) amino-terminal extracellular domain (219 amino-acid 
residues); ii) transmembrane region (23 amino-acid 
residues); iii) cytoplasmic domain (carboxyl terminus; 
21 amino-acid residues). In the extracellular domain 
there are three potential sites for N-glycosylation 
(Asn-X-Thr) . Thromboplastin is normally glycosylated but 
glycosylatlon does not appear essential for the activity 
of the protein (Paborsky et al.. Biochemistry, vol. 29 
(1989), 8072-8077). 

Thromboplastin is required as additive to plasma samples 
in diagnostic tests of coagulation. The coagulation 
status of the tested person can be found by the one-stage 
prothrombin clotting time determination (for example 
Quick's test). The thromboplastin required for diagnostic 
tests is currently obtained from human tissue, and the 
preparation process is difficult to standardize, the 
yield is low and considerable amounts of human starting 
material (placentae) must be supplied. On the other hand, 
it is to be xpected that pr paration of native, 
membrane-bound thromboplastin by g n tic ngineering will 
also be difficult owing to complex purification proc 8- 
8 8. Th ae difficult! s can be avoided by th fusion 
according to th inv ntion to immunoglobulin portions. 
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Th thromboplastin fusion proteins according to the 
invention ar s cr t d by mammalian c lis (for xampl 
CHO, BHK, COS c lis) into the cultur medium, purified by 
affinity chromatography on prot in A-Sepharoae and have 
surprisingly high activity in the one-stage prothrombin 
clotting time determination. 

Cloning of thromboplastin cDNA 

The sequence published by Scarpati et al . , Biochemistry, 
vol. 26 (1987), 5234-5238, was used for cloning the 
thromboplastin cDNA. Two oligonucleotide probe molecules 
(see Fig. 1) were derived from this. These two probe 
molecules were used to screen a cDNA bank from human 
placenta (Grundmann et al., Proc. Natl. Acad. Sci. USA, 
vol. 83 (1986), 8024-8028). 

cDNA clones of various lengths were obtained. One clone, 
2b-Apr5, which is used for the subsequent procedure, 
codes for the same amino-acid sequence as the cDNA 
described in Scarpati et al. Fig. 2 depicts the total 
sequence of the clone 2b-Apr5 with the thromboplastin 
amino-acid sequence deduced therefrom. 

Construction of a hybrid plasmid pTFlFc coding for 
thromboplastin fusion protein. 

The plasmid pCD4B gamma 1 (BP 0 325 262 A2; deposited at 
the ATCC under the number No. 67610) is used for 
expression of a fusion protein composed of human CD4 
receptor and human IgGl. The DNA sequence coding for the 
extracellular domain of CD4 is deleted from this plasmid 
using the restriction enzymes Hindi I I and BamHI. Only 
partial cleavage must be carried out with the enzyme 
Hindi I I in this cas , in order to cut at only on of the 
two Hindi I I sit s c ntained in pCD4B gamma 1 (position 
2198). The r suit is an opened vector in which a ukary- 
otic transcription r gulation s qu nc (promoter) is 
followed by the open Hindi 1 1 sit . Th open BamHI sit is 
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located at th start of th coding r gions for a penta- 
peptido link r, followed by the hing and the CH2 and CH3 
domains of human IgGl. The reading frame in th BamHI 
recognition a gu nc GGATCC ia auch that GAT is trans- 
lated as aspartic acid. DMA amplification with 
thermostable ONA polymerase makes it possible to modify 
a given sequence in such a way that any desired sequences 
are attached at one or both ends. Two oligonucleotides 
able to hybridise with sequences in the 5 ' -untranslated 

region (Ax 5' GATCGATTAAGCTTCGGAACCCGCTCGATCTCGCCGCC 3') 
or 

coding region 

(Bt 5' GCATATCTGGATCCCCGTAGAATATTTCTCTGAATTCCCC 3') of 
thromboplastin cDNA were synthesized. Of these, oligo- 
nucleotide A is partially homologous with the sequence of 
the coding strand, and oligonucleotide B is partially 
homologous with the non-coding strand; cf. Pig. 3. 

Thus, amplification results in a ONA fragment (827 bp) 
which contains (based on the coding strand) at the 5* end 
before the start of the coding sequence a Hindi I I site, 
and at the 3' end after the codon for the first three 
ami no -acid residues of the transmembrane region a BamHI 
site. The reading frame in the BamHI cleavage site is 
such that ligation with the BamHI site in pCD4B 
results in a gene fusion with a reading frame continuous 
from the initiation codon of the thromboplastin cDNA to 
the stop codon of the heavy chain of IgGl. The desired 
fragment was obtained and, after treatment with Hindi I I 
and BamHI, ligated into the vector pCD4B g amma 1, as 
described above, which had been cut with Hindi I I 
(partially) and BamHI. The resulting plasmld was called 
pTFIPe (Pig. 4). 

Trans feet ion of pTFIPe into mammalian cells 



Th fusion prot in ncod d by th plasmid pTFIPe is 
call d pTPIPe h r inaft r. pTPlFc was transiently 
expr ss d in COS c lis. For this purpose, COS c 11b wer 



2045 

trans f acted with pTPIPe with the aid of DEAE-d xtran 
(BP A 0 325 262). Indirect immunofluor scence investiga- 
tions revealed that the proportion of trans fee ted cells 
was about 25 %. 24 h after tranafection, the cells were 
transferred into serum- free medium. This cell supernatant 
was harvested after a further three days. 

Purification of pTPIPe fusion protein fxosi cell culture 
Bupernatants 



170 ml of supernatant from transiently trans fee ted COS 
cells were collected overnight In a batch process in a 
column containing 0.8 ml of protein A-Sepharose at 4*c, 
washed with 10 volumes of washing buffer (50 mM tris 
buffer pH 8.6, 150 mM NaCl) and eluted in 0.5 ml frac- 
tions with eluting buffer (93i7 100 mM citric acid« 
100 mM sodium citrate). The first 9 fractions were 
immediately neutralized with 0.1 ml of 2M tris buffer 
pH 8.6 in each case and then combined, and the resulting 
protein was transferred by three concentration/dilution 
cycles in an Amicon mlcroconcentrator (Centricon 30) into 
TNE buffer (50 mM tris buffer pH 7.4, 50 mM HaCl, 1 mM 
EDTA). The pTPIPe obtained in this way is pure by 
SDS-PAGE electrophoresis (U.K. Lammli, Nature 227 (1970) 
680-685). In the absence of reducing agents it behaves in 
the SDS-PAGE like a dimer (about 165 KDa) . 

Biological activity of purified TFlPc in tne prothro mb in 
clotting time determination 



TPlPc fusion protein is active in low concentrations 
(> 50 ng/ml) in the one-stage prothrombin clotting time 
determination (Vinaiser, H. Gerinnungsphysiologie und 
Method n im Blutg rinnungslabor (1979), Pish r V rlag 
Stuttgart). Th cl tting times achiev d are comparabl 
with th clotting times obtained with thromboplastin 
isolat d from human placenta. 
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Example 2 : Interleukln-4 receptor fusion proteins 

Int rl ukin-4 (IL-4) is synth sized by T c lis and was 
originally called B-cell growth factor because it is able 
to stimulate B-cell proliferation. It exerts a large 
5 number of effects on these cells. One in particular is 

the stimulation of synthesis of molecules of immuno- 
globulin subclasses IgGl and IgB in activated B cells 
(Coffmann et al., Immunol. Rev., vol. 102 (1988) 5). In 
addition, IL-4 also regulates the proliferation and 

10 differentiation of T cells and other hemopoietic cells. 

It thus contributes to the regulation of allergic and 
other immunological reactions. IL-4 binds with high 
affinity to a specific receptor. The cDNA which codes for 
the human IL-4 receptor has been isolated (Idzerda et 

15 al., J. Exp. Med., vol. 171 (1990) 861-873). It is 

evident from analysis of the amino-acid sequence deduced 
from the cDNA sequence that the IL-4 rece k .or is composed 
of a total of 825 amino acids, with the 25 N-terminal 
amino acids acting as signal peptide. Mature human IL-4 

20 receptor is composed of 800 amino acids and, like 

thromboplastin, has a three-domain structure i i) amino- 
terminal extracellular domain (207 amino acids); 
ii) transmembrane region (24 amino acids) and ill) 
cytoplasmic domain (569 amino acids). In the extra- 

25 cellular domain there are six potential sites for 
N-glycosylation (Asn-X-Thr/Ser) . IL-4 receptor has 
homologies with human IL-6 receptor, with the l-subunit 
of human IL-2 receptor, with mouse erythropoietin 
receptor and with rat prolactin receptor ( Idzerda et al . , 

30 loc. cit.). Thus, like thromboplastin, it is not a member 

of the immunoglobulin family but is assigned together 
with the homologous proteins mentioned to the new family 
of hematopoietin receptors. Members of this family hav 
four cyst ine r sidu s and a cons rved sequenc 

35 (Trp-Ser-X-Trp-S r) in the xtrac llular domain located 

near th transmembran r gion in common . 

On the basis of th described function of th IL-4/IL-4 



- 10 - 



2045863 



rec ptor system, th re is a possible therapeutic use of 
a recombinant form of th IL-4 r c ptor for suppressing 
IL-4 -mediated immune reactions (for xampl transplant 
r jection reaction, autoimmune dis as s, all rgic r ac- 
5 tions ) . 

The amount of substance required for therapy makes it 
necessary to prepare such molecules by genetic 
engineering. Because of the straightforward purification 
by affinity chromatography and improved pharmacokinetic 
10 properties, according to the invention the synthesis of 

soluble forms of the IL-4 receptor as immunoglobulin 
fusion protein is particularly advantageous. 

The IL-4 receptor fusion proteins are secreted by mammal- 
ian cells (for example CHO, BHK, COS cells) into the 
15 culture medium, purified by affinity chromatography on 

protein A-Sepharose and have, surprisingly, identical 
functional properties to the extracellular domain of the 
intact membrane-bound IL-4 receptor molecule. 

Construction of a hybrid plasmid pll*— 4RFc coding for IL-4 
20 receptor fusion protein. 

Cutting of the plasmid pCD4E gamma 1 with Xhol and BamHI 
results in an opened vector in which the open Xhol site 
is located downstream from the promoter sequence. The 
open BamHI site is located at the start of the coding 

25 regions for a pentapeptide linker, followed by the hinge 

and the CH2 and CH3 domains of human IgGl. The reading 
frame in the BamHI recognition sequence GGATCC is such 
that GAT is translated as aspartic acid. DMA amplifica- 
tion with thermostable DNA polymerase makes it possible 

30 to modify a given sequence in such a way that any desired 

s qu nces can be attached at on or both ends. Two 
oligonucl otid s abl to hybridiz with sequ nc s in th 
5 '-untranslated r gion 

(As 5' GATC CAGT AC TC GAGAGAGAAGC C GGGC GTGGTGGC TCATGC 3 ' ) or 
35 coding r gion 
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( B : 5 ' CTATGACATGGATCCTGCTCGAAGGGCTCCCTGTAGGAGTTGTG 3 ' ) 
of th IL-4 rec ptor cDKA which is cloned in the vector 
pDC302/T22-8 ( Idzerda t al., loc . cit.) were 
synthesized. Of these, oligonucleotide A is partially 
homologous with the sequence of the coding strand, and 
oligonucleotide B is partially hanologous with the non- 
coding strand; cf. Fig. 5. Amplification using thermo- 
stable DHA polymerase results in a OHA fragment (836 bp) 
which, based on the coding strand, contains at the 5' end 
before the start of the coding sequence an Xhol site, and 
at the 3 ' end before the last codon of the extracellular 
domain a BamHI site. The reading frame in the BaaHI 
cleavage site is such that ligation with the BamHI site 
in pCD4E gamma 1 results in a gene fusion with a reading 
frame continuous from the initiation codon of the IL-4 
receptor cDNA to the stop codon of the heavy chain of 
IgGl. The desired fragment was obtained and, after 
treatment with Xhol and BamHI, ligated into the vector 
pCD4B gamma 1, described above, which had been cut with 
Xhol /BamHI. The resulting plasmld was called pIL4RFc 
(Fig. 6). 

Trana faction of pIL4RFc into mammalian calls 

The fusion protein encoded by the plasmld pIL4RFc is 
called pIL4RFc hereinafter. plL4RFc was transiently 
expressed in COS cells. For this purpose, COS cells w a r e 
trans fee ted with pIL4RFc with the aid of DBAB-dextran 
(BP A 0 325 262). Indirect immunofluorescence investiga- 
tions revealed that the proportion of trana f acted cells 
was about 25 %. 24 h after transfection, the cells were 
transferred into serum- free medium. This cell supernatant 
was harvested after a further three days. 

Purification of XL4RFc fusion protein from cell culture 
supernatants 

500 ml of supernatant from transiently trans f acted COS 
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c lis were coll cted ov might in a batch proc ss in a 
column containing 1.6 ml of protein A-Sepharoa at 4*C, 
wash d with 10 volumes of washing buff r (50 mM tris 
buffer pH 8.6, 150 mM NaCl) and eluted in 0.5 ml frac- 
5 tions with eluting buffer (93:7 100 mM citric acid: 

100 mM sodium citrate) . The first 9 fractions were 
immediately neutralized with 0.1 ml of 2M tris buffer 
pH 8.6 in each case and then combined , and the resulting 
protein was transferred by three concentration/dilution 

10 cycles in an Amicon microconcentrator (Centricon 30) into 

TNE buffer (50 mM tris buffer pH 7.4, 50 mM NaCl, 1 mM 
EDTA) . The IL4RFc obtained in this way is pure by 
SDS-PAGE electrophoresis (U.K. Lammll, Nature 227 (1970) 
680-685). In the absence of reducing agents it behaves in 

15 the SDS-PAGE like a dimer (about 150 KDa) . 

Biological activity of purified H*4RFc 

IL4RFc proteins binds "^I -radiolabeled IL-4 with the same 
affinity (Kq b 0.5 nM) as membrane-bound intact IL-4 recep- 
tor. It inhibits the proliferation of IL-4 -dependent cell 

20 line CTLLHuIL-4RI clone D (Idzerda et al., loc. cit.) in 

concentrations of 10-1000 ng/ml. In addition, it is 
outstandingly suitable for developing IL-4 binding assays 
because it can be bound via its Fc part to mic rot iter 
plates previously coated with, for example, rabbit anti- 

25 human IgG, and in this form likewise binds its ligands 

with high affinity. 

Example 3s Erythropoietin fusion proteins 

Mature erythropoietin (EPO) is a glycoprotein which is 
composed of 166 amino acids and is essential for the 

30 development of erythrocytes. It stimulates the maturation 

and the t rminal diff r ntiation of erythroid precursor 
c lis. Th cDNA for human EPO has been clon d 
(EP-A-0 267 678) and cod s for th 166 amino acids of 
matur EPO and a signal peptide of 22 amino acids which 

35 is ss ntial for a cr tion. The cDNA can be used to 
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pr pare recombinant functional EPO in gen tically 
manipulat d mammalian cells and the EPO can be employed 
clinically for th th rapy of anemic manifestations of 
various etiologies (for example associated with acute 
5 renal failure) . 

Because of the straightforward purification and the 
improved pharmacokinetic properties, according to the 
invention synthesis of EPO as immunoglobulin fusion 
protein is particularly advantageous . 

10 Construction of a hybrid plasmid pEPOFc coding for 

erythropoietin fusion protein. 

This construction was carried out in analogy to that 
described in Example 2 (sections "Construction of a 
hybrid plasmid pIL-4RFc coding for IL-4 receptor fusion 
15 protein"). Two oligonucleotides able to hybridize with 
sequences in the vicinity of the initiation codon 
(A: 5 ' GATCGATCTCGAGATGGGGGTGCACGAAT6TCCTGCCTGGCTGTG6 3') 
and of the stop codon 

(B: 5' CTGGAATCGGATCCCCTGTCCTGCAGGCCTCCCCTGTGTACAGC 3') 

20 of the EPO cDNA cloned in the vector pCES (EP-A 0 267 

678) were synthesized. Of these, oligonucleotide A is 
partially homologous with the sequence of the coding 
strand, and oligonucleotide B is partially homologous 
with the non-coding strand; cf. Fig. 7. Amplification 

25 with thermostable DNA polymerase results in a DNA frag- 
ment (598 bp) which, based on the coding strand, contains 
at the 5' end in front of the initiation codon an Xhol 
site and in which at the 3' end the codon for the 
penultimate C-terminal amino acid residue of the EPO 

30 (Asp) is present in a BamHI recognition sequence. The 

reading frame in the BamHI cleavage site is such that 
ligation with th BamHI site in pCD4E gamma 1 r suits in 
a g n fusion with a reading frame continuous, from th 
initiation codon of EPO cDNA to th stop codon of the 

35 h avy chain of IgGl. Th d sir d fragm nt was obtained 

and, aft r tr atm nt with Xhol and BamHI, ligat d into 
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th v ctor pCD4E gamma 1, described abov , which had be n 
cut with XhoI/BamHI. Th resulting plasmid was call d 
pEPOFc ( Fig . 8 ) . 
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THE EMBODIMENTS OF THE INVENTION IN WHICH AN EXCLUSIVE 
PROPERTY OR PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS x 

1. A solubl fusion prot in compos d of human proteins 
not belonging to the immunoglobulin family, or of 
parts thereof, and of various portions of immuno- 
globulin molecules of all subclasses. 

2. A fusion protein as claimed in claim 1, wherein the 
immunoglobulin portion is the constant part of the 
heavy chain of human IgG. 

3. A fusion protein as claimed in claim 2, wherein the 
immunoglobulin portion is the constant part of the 
heavy chain of human IgGl or a protein A-binding 
fragment thereof. 

4. A fusion protein as claimed in claim 2 or claim 3, 
wherein the fusion takes place at the hinge region. 

5. A fusion protein as claimed in claims 1-4, wherein 
the protein fused to immunoglobulin is the extra- 
cellular portion of a membrane protein or parts 
thereof . 

6. A fusion protein as claimed in claims 1-4, wherein 
the protein fused to immunoglobulin is the extra- 
cellular portion of thromboplastin or parts thereof. 

7. A fusion protein as claimed in claims 1-4, wherein 
the protein fused to immunoglobulin is the extra- 
cellular portion of a cytokine receptor or growth 
factor receptor or parts thereof . 

8. A fusion protein as claimed in claim 7, wherein the 
prot in fused to immunogl bulin is th extracellular 
portion of IL-4 r c ptor or parts th r of. 

9. A fup'on prot in as claimed in claim 7, wh r in th 
prot in fused to immunoglobulin is th xtrac llular 
portion f IL-7 r c ptor or parts ther of. 



20458 

A fusion prot in as claim d in claim 7, wher in th 
prot in fused to immunoglobulin is the xtrac llular 
portion of tumor necrosis factor r c ptor or parts 
thereof . 

A fusion protein as claimed in claim 7, wherein the 
protein fused to immunoglobulin is the extracellular 
portion of G-CSF receptor or parts thereof. 

A fusion protein as claimed in claim 7, wherein the 
protein fused to immunoglobulin is the extracellular 
portion of GM-CSF receptor or parts thereof. 

A fusion protein as claimed in claim 7, wherein the 
protein fused to immunoglobulin is the extracellular 
portion of erythropoietin receptor or parts thereof. 

A fusion protein as claimed in claims 1-4, wherein 
the protein fused to Immunoglobulin is a non- 
membrane-bound soluble protein or part thereof. 

A fusion protein as claimed in claim 14, wherein the 
protein fused to Immunoglobulin is a cytokine or 
growth factor or part thereof. 

i 

A fusion protein as claimed in claim 15, wherein the 
protein fused to immunoglobulin is erythropoietin or 
part thereof. 

A fusion protein as claimed in claim 15, wherein the 
protein fused to immunoglobulin is GM-CSF or G-CSF 
or part thereof. 



A fusion protein as claimed in claim 15, wher in th 
prot in fused to immunoglobulin is interl ukin IL-1 
to IL-8 or part th reof. 
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19. A fusion prot in as claim d in any of prec ding 
claims 1-18, wh r in a factor Xa cleavag sit is 
additionally ins rted bet we n th immunoglobulin 
part and the non- Immunoglobulin part. 

5 20. A process for preparing fusion proteins as claimed 

in any of claims 1-19/ which comprises introducing 
the DHA coding for these constructs into a mammalian 
cell expression system and, after expression, 
purifying the produced fusion protein by affinity 
10 chromatography via the immunoglobulin portion. 

21. The use of the fusion proteins as claimed in any of 
claims 1-19 for diagnosis. 

22. The use of the fusion proteins as claimed in any of 
claims 1-19 for therapy. 
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3. The fusion protein as claimed in claim 1 and 

ubstantially as described h rein. 



GTCGCTCGGACGCTCCTGCTCGGCTGGGTCTTCGCCCAGGTGGCCGGCGCTTCAGGCACT 
121 + — ------+ + — jQo 

CAGCGAGCCTGCGAGGACGAGCCGACCCAGAAGCGGGTCCACCGGCCGCGAAGTCCGTGA 

<*************************** 

Oligonucleotide 1 

ACAAATACTGTGGCAGCATATAATTTAACTTGGAAATCAACTMTTTCAAGACAATTTTG 
181 + + + + + 240 

TGTTTATGACACCGTCGTATATTAAATTGAACCTTTAGTTGATTAAAGTTCTGTTAAAAC 
**************************** j 



Oligonucleotide 2 
*********************************** M> 

AACTACTGTTTCAGTGTTCAAGCAGTGATTCCCTCCCGAACAGTTAACCGGAAGAGTACA 
7 21 + + + + + + 780 

TTGATGACAAAGTCACAAGTTCGTCACTAAGGGAGGGCTTGTCAATTGGCCTTCTCATGT 



10 30 50 

GCCCCCCCTCGAGGTCGACGGTATCGATAAGCTTGATATCGAATTCTCTCGGCGAACCCC 

70 90 110 

CTCGCACTCCCTCTGGCCGGCCCAGGGCGCCTTCAGCCCAACCTCCCCAGCCCCACGGGC 

130 150 170 

GCCACGGAACCCGCTCGATCTCGCCGCCAACTGGTAGACATGGAGACCCCTGCCTGGCCC 

MetGl uThrProAl aTrpPro 

190 210 230 

CGGGTCCCGCGCCCCGAGACCGCCGTCGCTCGGACGCTCCTGCTCGGCTGGGTCTTCGCC 
ArgVal ProArgProGl uThrAl aVal Al aArgThrLeuLeuLeuGlyTrpVal PheAl a 

250 270 290 

CAGGTGGCCGGCGCTTCAGGCACTACAAATACTGTGGCAGCATATAATTTAACTTGGAAA 
Gl nVal Al aGlyAl aSerGlyThrThrAsnThrVal Al aAl aTyrAsnLeuThrTrpLys 

310 330 350 

TCAACTAATTTCAAGACAATTTTGGAGTGGGAACCCAAACCC6TCAATCAAGTCTACACT 
SerThrAsnPheLysThrll eLeuGl uTrpGl uProLysProVal AsnGl nValTyrThr 

370 390 410 

GTTCAAATAAGCACTAAGTCAGGAGATTGGAAAAGCAAATGCTTTTACACAACAGACACA 
Val Gl n II eSerThrLysSerGlyAspTrpLysSerLysCysPheTyrThrThrAspThr 

430 450 470 

GAGTGTGACCTCACCGACGAGATTGTGAAGGATGTGAAGCAGACGTACTTGGCACGGGTC 
Gl uCysAspLeuThrAspGl ull eVal LysAspVal LysGl nThrTyrLeuAl aArgVal 

490 510 530 

TTCTCCTACCCGGCAGGGAATGTGGAGAGCACCGGTTCTGCTGGGGAGCCTCTGTATGAG 
PheSerTyrProAl aGI yAsnVal Gl uSerThrGl ySerAl aGlyGl uProLeuTyrGl u 

550 570 590 

AACTCCCCAGAGTTCACACCTTACCTGGAGACAAACCTCGGACAGCCAACAATTCAGAGT 
AsnSerProGluPheThrProTyrLeuGluThrAsnLeuGlyGlnProThrlleGlnSer 




/f(ja-l /Vr. /,'/' ; Par. 
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610 630 650 

TTTGAACAGGTGGGAACAAAAGTGAATGTGACCGTAGAAGATGAACGGACTTTAGTCAGA 
PheGl uGl nVal GlyThrLysVal AsnValThrVal Gl uAspGl uArgThrLeuVal Arg 

670 690 710 

AGGMCAACACTTTCCTAAGCCTCCGGGATGTTTTTGGCAAGGACTTAATTTATACACTT 
ArgAsnAsnThrPheLeuSerLeuArgAspVal PheGlyLysAspLeu II eTyrThrLeu 

730 750 770 

TATTATTGGAAATCTTCAAGTTCAGGAAAGAAAACAGCCAAAACAAACACTAAT6AGTTT 
TyrTyrTrpLysSerSerSerSerGlyLysLysThrAl aLysThrAsnThrAsnGl uPhe 

790 810 830 

TTGATTGATGTGGATAAAG6AGAAAACTACTGTTTCAGTGTTCAAGCAGTGATTCCCTCC 
LeuIleAspValAspLysGlyGluAsnTyrCysPheSerValGlnAl aVal IleProSer 

850 870 890 

CGAACAGTTAACCGGAAGAGTACAGACAGCCCGGTAGAGTGTATGGGCCAGGAGAAAGGG 
ArgThrValAsnArgLysSerThrAspSerProValGl uCysMetGlyGTnGluLysGly 

910 930 950 

GAATTCAGAGAAATATTCTACATCAT7GGAGCTGTGGTATTTGTGGTCATCATCCTTGTC 
GluPheArgGluIlePheTyrllelleGlyAl aValValPheValVal IlelleLeuVal 

970 990 1010 

ATCATCCTGGCTATATCTCTACACAAGTGTAGAAAGGCAGGAGTGGGGCAGAGCTGGAAG 
IlelleLeuAlalleSerLeuHisLysCysArgLysAI aGlyValGlyGlnSerTrpLys 

1030 1050 1070 

GAGAACTCCCCACTGAATGTTTCATAAAGGAAGCACTGTTGGAGCTACTGCAAATGCTAT 
Gl uAsnSerProLeuAsnVal Ser 

1090 1110 1130 

ATTGCACTGTGACCGAGAACTTTTAAGAGGATAGAATACATGGAAACGCAAATGAGTATT 

1150 1170 1190 

TCGGAGCATGAAGACCCTGGAGTTCAAAAAACTCTTGATATGACCTGTTATTACCATTAG 



_Jhig: 2 (contj 



1210 1230 1250 

CATTCTGGTTTTGACATCAGCATTAGTCACTTTGAAAT6TAACGAATGGTACTACAACCA 

1270 1290 1310 

ATTCCAAGTTTTAATTTTTMCACCArGGCACCTTTTGCACATAACATGCTTTAGATTAT 

1330 1350 1370 

ATATTCCGCACTTAAGGATTAACCAGGTCGTCCAAGCAAAAACAAATGGGAAAATGTCTT 

1390 1410 1430 

AAAAAATCCTGG6TGGACTTTT6AAAA6C 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I TGAGACG6A6TC 

1450 1470 1490 

TTGCTCTGTTGCCCAGGCTGGAGTGCAGTAGCACGATCTCGGCTCACTTGCACCCTCCGT 

1510 1530 1550 

CTCTCGGGTTCAAGCAATTGTCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGTGCGC 

1570 1590 1610 

ACTAC C AC GC C AAGCTAATTTTTGT ATTTTTTAGTAGAG ATGGGGTTTC ACC ATCTTGGC 

1630 1650 1670 

CAGGCTGGTCTTGAATTCCTGACCTCAGTGATCCACCCACCTTGGCCTCCCAAAGATGCT 

1690 1710 1730 

AGTATTATGGGCGTGAACCACCATGCCCAGCCGAAAAGCTTTTGAGGGGCTGACTTCAAT 

1750 1770 1790 

CCATGTAGGAMGTAAAATGGAAGGAAATTGGGTGCATTTCTAGGACTTTTCTAACATAT 

1810 1830 1850 

GTCTATAATATAGTGTTTAGGTTCTTTTTTTTTTCAGGAATACATTTGGAAATTCAAAAC 

1870 1890 1910 

AATTGGGCAAACTTTGTATTAATGTGTTAAGTGCAGGAGACATTGGTATTCTGGGCAGCT 
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B*in 2 Z (cont) 

1930 1950 1970 

TCCTAATATGCTTTACAATCTGCACTTTAACTGACTTAAGTGGCATTAAACATTTGAGAG 

1990 2010 2030 

CTAACTATATTTTTATAA6ACTACTATACAAACTACAGAGTTTATGATTTAAGGTACTTA 

2050 2070 2090 

AAGCTTCTATGGTTGACATTGTATATATAATTTTTTAAAAAGGTTTTTCTATATGGGGAT 

2110 2130 2150 

TTTCTATTTATGTAGGTAATATTGTTCTATTTGTATATATTGAGATAATTTATTTAATAT 

2170 

ACTTTAAATAAAGGTGACTGGGAATTGTT 
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Hindlll 

5' GATCGATTAAGCTTCGGAACCCGCTCGATCTCGCCGCC 3' Oligonucleotide A 

Illlllllllllllllilllllll 

AGCCCCACGGGCGCCACGGAACCCGCTCGATCTCGCCGCCAACTGGTAGACATGGAG 
110 * * ♦ + + 167 

TCGGGGTGCCCGCGGTGCCTTGGGCGAGCTAGAGCGGCGGTTGACCATCTGTACCTC 

MetGlu 

I- 

5 • -untranslated Start 

Reading frame 
(signal peptide) 



End of extracellular domain | Start of transmembrane region 

I ~ 

Gl nGl uLysGlyGI uPheArgGl u 1 1 ePheTyrll ell eGl yAl aVal 

CAGGAGAAAGGGGAATTCAGAGAAATATTCTACATCATTGGAGCTGTGGT 
890 ♦ ♦ + , 940 

GTCCTCTTTCCCCTTAAGTCTCTTTATAAGATGTAGTAACCTCGACACCA 
Illlllllllllllllilllllll 
3' CCCCTTAAGTCTCTTTATAAGATGCCCCTAGGTCTATACG 5' Oligonucleotide B 

BamHI 



\ 
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Xhol 

5' GATCCA6TACTCGAGAGAGAAGCCGGGCGTGGTGGCTCATGC 3' Oligonucleotide A 

1 1 1 1 J 1 1 1 1 1 1 r 1 1 1 1 [ 1 1 1 1 1 1 ! 1 1 } 

AGAGAAGCCGGGCGTGGTGGCTCATGCCTATAATCCCAGCACTTTTGGAGGCTGAGGCGG 

61 + ---+ + ♦ ♦ + 120 

TCTCTTCGGCCCGCACCACCGAGTACGGATATTAGGGTCGTGAAAACCTCCGACTCCGCC 



5 • -untranslated 



GCAGATCACTTGAGATCAGGAGTTCGAGACCAGCCTGGTGCCTTGGCATCTCCCAATGGG 

121 + + ♦ ♦ + + 180 

CGTCTAGTGAACTCTAGTCCTCAAGCTCTGGTCGGACCACGGAACCGTAGAGGGTTACCC 



•5 ' -untranslated |MetGly 



Start 

Reading frame (signal peptide) 



End of extracellular domain | Start of transmembrane region 



H1 sAsnSerTyrArgGI uProPheGl uGl nH 1 sleuLeuLeuGlyVal SerVal SerCy s 
CACAACTCCTACAGGGAGCCCTTCGAGCAGCACCTCCTGCTGGGCGTCAGCGTT7CCTGC 
839 -+ ♦ + -f ♦ 898 

GTGTTGAGGATGTCCCTCGGGAAGCTCGTCGTGGAGGACGACCCGCAGTCGCAAAGGACG 

IIIIIIIIIIIIIIIIIIIMIIIIIIIII 

3' GTGTTGAGGATGTCCCTCGGGAAGCTCGTCCTAGGTACAGTATC 5' Oligonucleotide B 



BamHI 
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Xhol 

5' GATCGATCTCGAGATGGGGGTGCACGAATGTCCTGCCTGGCTGTGG 3' Oligonucleotide A 

IIIIIIIIIIKIIIIIIIIIIIIIIIIIIIII 

ATGGGGGTGCACGAATGTCCTGCCTGGCTGTGGCTTCTCCTGTCCCTGCTGTCG 
182 + + + + + 235 

TACCCCCACGTGCTTACAGGACGGACCGACACCGAAGAGGACAGGGACGACAGC 

MetGlyValHisGluCysProAlaTrpLeuTrpLeuLeuLeuSerLeuLeuSer - 
Start reading frame (signal peptide) 



End of reading frame 1 

LeuTyrThrGlyGluAl aCysArgThrGl yAspArgEnd 

I 

GCTGTACACAGGGGAGGCCTGCAGGACAGGGGACAGATGACCAGGTGTGTCCACCTGGGC 
724 ♦ ♦ + + + 783 

CGACATGTGTCCCCTCCGGACGTCCTGTCCCCTGTCTACTGGTCCACACAGGTGGACCCG 
lllllllllllllllllllllllllllllllll 
3' CGACATGTGTCCCCTCCGGACGTCCTGTCCCCTAGGCTAAGGTC 5' Oligonucleotide B 

BamHI 
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